Computer and Modernization ›› 2013, Vol. 1 ›› Issue (9): 35-37,4.doi: 10.3969/j.issn.1006-2475.2013.09.008

• 人工智能 • Previous Articles     Next Articles

MapReduce-based Web Log Mining Preprocessing

MAO Yan-qi, PENG Pei-fu   

  1. School of Physics and Information Science, Hunan Normal University, Changsha 410081, China
  • Received:2013-03-25 Revised:1900-01-01 Online:2013-09-17 Published:2013-09-17

Abstract: This article describes the general process of Web log mining and focuses on the study of Web session division in the preprocessing of Web log mining. The article introduces the concept and advantages of cloud computing. For the bottleneck analysis of Web log mining, as well as the difficulties of data storage and sharing, we proposed MapReduce-based Web log mining preprocessing, which can better solve the efficiency issues currently faced in Web log mining and better integrate the computer resources to reduce unnecessary waste.

Key words: Web log mining, MapReduce, session division

CLC Number: